Overview
Brought to you by YData
Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 1605 |
| Missing cells | 2671 |
| Missing cells (%) | 7.9% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 275.9 KiB |
| Average record size in memory | 176.0 B |
Variable types
| Numeric | 19 |
|---|---|
| Text | 2 |
bod_max is highly overall correlated with bod_min and 3 other fields | High correlation |
bod_min is highly overall correlated with bod_max | High correlation |
conductivity_max is highly overall correlated with bod_max and 1 other fields | High correlation |
conductivity_min is highly overall correlated with bod_max and 1 other fields | High correlation |
do_max is highly overall correlated with do_min | High correlation |
do_min is highly overall correlated with bod_max and 1 other fields | High correlation |
fecal_coliform_max is highly overall correlated with fecal_coliform_min and 4 other fields | High correlation |
fecal_coliform_min is highly overall correlated with fecal_coliform_max and 4 other fields | High correlation |
fecal_streptococci_max is highly overall correlated with fecal_coliform_max and 4 other fields | High correlation |
fecal_streptococci_min is highly overall correlated with fecal_coliform_max and 3 other fields | High correlation |
nitrate_max is highly overall correlated with nitrate_min | High correlation |
nitrate_min is highly overall correlated with nitrate_max | High correlation |
ph_max is highly overall correlated with ph_min | High correlation |
ph_min is highly overall correlated with ph_max | High correlation |
total_coliform_max is highly overall correlated with fecal_coliform_max and 3 other fields | High correlation |
total_coliform_min is highly overall correlated with fecal_coliform_max and 4 other fields | High correlation |
state_name has 124 (7.7%) missing values | Missing |
nitrate_min has 56 (3.5%) missing values | Missing |
nitrate_max has 56 (3.5%) missing values | Missing |
fecal_coliform_min has 184 (11.5%) missing values | Missing |
fecal_coliform_max has 185 (11.5%) missing values | Missing |
total_coliform_min has 224 (14.0%) missing values | Missing |
total_coliform_max has 224 (14.0%) missing values | Missing |
fecal_streptococci_min has 766 (47.7%) missing values | Missing |
fecal_streptococci_max has 767 (47.8%) missing values | Missing |
temp_min is highly skewed (γ1 = 31.20516608) | Skewed |
conductivity_min is highly skewed (γ1 = 34.51328681) | Skewed |
bod_min is highly skewed (γ1 = 29.27994682) | Skewed |
bod_max is highly skewed (γ1 = 27.00620253) | Skewed |
nitrate_min is highly skewed (γ1 = 27.50700342) | Skewed |
nitrate_max is highly skewed (γ1 = 35.10507529) | Skewed |
fecal_coliform_min is highly skewed (γ1 = 34.37295455) | Skewed |
fecal_coliform_max is highly skewed (γ1 = 21.59338436) | Skewed |
total_coliform_min is highly skewed (γ1 = 33.49430653) | Skewed |
total_coliform_max is highly skewed (γ1 = 24.02009987) | Skewed |
fecal_streptococci_max is highly skewed (γ1 = 24.80671161) | Skewed |
bod_min has 56 (3.5%) zeros | Zeros |
bod_max has 52 (3.2%) zeros | Zeros |
nitrate_min has 170 (10.6%) zeros | Zeros |
nitrate_max has 122 (7.6%) zeros | Zeros |
Reproduction
| Analysis started | 2025-08-31 04:45:50.735319 |
|---|---|
| Analysis finished | 2025-08-31 04:46:21.641173 |
| Duration | 30.91 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
station_code
Real number (ℝ)
| Distinct | 1604 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4516.0486 |
| Minimum | 1 |
|---|---|
| Maximum | 30089 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1062.2 |
| Q1 | 1819 |
| median | 2951 |
| Q3 | 4415 |
| 95-th percentile | 10154.8 |
| Maximum | 30089 |
| Range | 30088 |
| Interquartile range (IQR) | 2596 |
Descriptive statistics
| Standard deviation | 5948.38 |
|---|---|
| Coefficient of variation (CV) | 1.3171647 |
| Kurtosis | 12.368924 |
| Mean | 4516.0486 |
| Median Absolute Deviation (MAD) | 1348 |
| Skewness | 3.5466915 |
| Sum | 7248258 |
| Variance | 35383224 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4118 | 2 | 0.1% |
| 1276 | 1 | 0.1% |
| 3887 | 1 | 0.1% |
| 2406 | 1 | 0.1% |
| 2404 | 1 | 0.1% |
| 1274 | 1 | 0.1% |
| 1272 | 1 | 0.1% |
| 2405 | 1 | 0.1% |
| 1271 | 1 | 0.1% |
| 1270 | 1 | 0.1% |
| Other values (1594) | 1594 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 7 | 1 | |
| 9 | 1 | |
| 10 | 1 | |
| 11 | 1 | |
| 12 | 1 | |
| 13 | 1 |
| Value | Count | Frequency (%) |
| 30089 | 1 | |
| 30088 | 1 | |
| 30087 | 1 | |
| 30086 | 1 | |
| 30085 | 1 | |
| 30084 | 1 | |
| 30083 | 1 | |
| 30082 | 1 | |
| 30081 | 1 | |
| 30080 | 1 |
| Distinct | 1598 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 6 |
| Missing (%) | 0.4% |
| Memory size | 25.1 KiB |
Length
| Max length | 163 |
|---|---|
| Median length | 114 |
| Mean length | 48.145716 |
| Min length | 8 |
Unique
| Unique | 1597 ? |
|---|---|
| Unique (%) | 99.9% |
Sample
| 1st row | mg/L mg/L Ml 100 mL |
|---|---|
| 2nd row | RIVER BEAS AT U/S MANALI |
| 3rd row | RIVER BEAS AT D/S MANALI |
| 4th row | RIVER BEAS D/S OF WASTE PROCESSING FACILITY AT MANALI |
| 5th row | RIVER BEAS D/S MANALSU NALLAH |
| Value | Count | Frequency (%) |
| river | 1636 | 13.8% |
| at | 1278 | 10.8% |
| d/s | 309 | 2.6% |
| of | 274 | 2.3% |
| near | 259 | 2.2% |
| bridge | 237 | 2.0% |
| u/s | 213 | 1.8% |
| village | 147 | 1.2% |
| ganga | 112 | 0.9% |
| district | 109 | 0.9% |
| Other values (2908) | 7261 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 10828 | |
| 10236 | ||
| R | 7509 | 9.8% |
| I | 5252 | 6.8% |
| E | 4528 | 5.9% |
| T | 3782 | 4.9% |
| N | 3664 | 4.8% |
| D | 2650 | 3.4% |
| H | 2630 | 3.4% |
| S | 2434 | 3.2% |
| Other values (53) | 23472 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 76985 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 10828 | |
| 10236 | ||
| R | 7509 | 9.8% |
| I | 5252 | 6.8% |
| E | 4528 | 5.9% |
| T | 3782 | 4.9% |
| N | 3664 | 4.8% |
| D | 2650 | 3.4% |
| H | 2630 | 3.4% |
| S | 2434 | 3.2% |
| Other values (53) | 23472 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 76985 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 10828 | |
| 10236 | ||
| R | 7509 | 9.8% |
| I | 5252 | 6.8% |
| E | 4528 | 5.9% |
| T | 3782 | 4.9% |
| N | 3664 | 4.8% |
| D | 2650 | 3.4% |
| H | 2630 | 3.4% |
| S | 2434 | 3.2% |
| Other values (53) | 23472 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 76985 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 10828 | |
| 10236 | ||
| R | 7509 | 9.8% |
| I | 5252 | 6.8% |
| E | 4528 | 5.9% |
| T | 3782 | 4.9% |
| N | 3664 | 4.8% |
| D | 2650 | 3.4% |
| H | 2630 | 3.4% |
| S | 2434 | 3.2% |
| Other values (53) | 23472 |
state_name
Text
Missing 
| Distinct | 116 |
|---|---|
| Distinct (%) | 7.8% |
| Missing | 124 |
| Missing (%) | 7.7% |
| Memory size | 25.1 KiB |
Length
| Max length | 106 |
|---|---|
| Median length | 41 |
| Mean length | 11.548953 |
| Min length | 3 |
Unique
| Unique | 78 ? |
|---|---|
| Unique (%) | 5.3% |
Sample
| 1st row | HIMACHAL PRADESH |
|---|---|
| 2nd row | HIMACHAL PRADESH |
| 3rd row | HIMACHAL PRADESH |
| 4th row | HIMACHAL PRADESH |
| 5th row | HIMACHAL PRADESH |
| Value | Count | Frequency (%) |
| pradesh | 438 | |
| madhya | 152 | 6.3% |
| himachal | 143 | 5.9% |
| odisha | 125 | 5.2% |
| maharashtra | 118 | 4.9% |
| uttar | 113 | 4.7% |
| karnataka | 110 | 4.6% |
| bihar | 98 | 4.1% |
| assam | 92 | 3.8% |
| jharkhand | 60 | 2.5% |
| Other values (133) | 964 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 3787 | |
| H | 1760 | |
| R | 1429 | 8.4% |
| S | 1103 | 6.4% |
| D | 993 | 5.8% |
| 932 | 5.4% | |
| T | 893 | 5.2% |
| M | 867 | 5.1% |
| I | 702 | 4.1% |
| E | 674 | 3.9% |
| Other values (23) | 3964 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 17104 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 3787 | |
| H | 1760 | |
| R | 1429 | 8.4% |
| S | 1103 | 6.4% |
| D | 993 | 5.8% |
| 932 | 5.4% | |
| T | 893 | 5.2% |
| M | 867 | 5.1% |
| I | 702 | 4.1% |
| E | 674 | 3.9% |
| Other values (23) | 3964 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 17104 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 3787 | |
| H | 1760 | |
| R | 1429 | 8.4% |
| S | 1103 | 6.4% |
| D | 993 | 5.8% |
| 932 | 5.4% | |
| T | 893 | 5.2% |
| M | 867 | 5.1% |
| I | 702 | 4.1% |
| E | 674 | 3.9% |
| Other values (23) | 3964 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 17104 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 3787 | |
| H | 1760 | |
| R | 1429 | 8.4% |
| S | 1103 | 6.4% |
| D | 993 | 5.8% |
| 932 | 5.4% | |
| T | 893 | 5.2% |
| M | 867 | 5.1% |
| I | 702 | 4.1% |
| E | 674 | 3.9% |
| Other values (23) | 3964 |
temp_min
Real number (ℝ)
Skewed 
| Distinct | 149 |
|---|---|
| Distinct (%) | 9.3% |
| Missing | 7 |
| Missing (%) | 0.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 21.876471 |
| Minimum | 0.3 |
|---|---|
| Maximum | 3836 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.1 KiB |
Quantile statistics
| Minimum | 0.3 |
|---|---|
| 5-th percentile | 6.185 |
| Q1 | 15 |
| median | 19 |
| Q3 | 22 |
| 95-th percentile | 27 |
| Maximum | 3836 |
| Range | 3835.7 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 109.11229 |
|---|---|
| Coefficient of variation (CV) | 4.9876553 |
| Kurtosis | 1020.5604 |
| Mean | 21.876471 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 31.205166 |
| Sum | 34958.6 |
| Variance | 11905.493 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20 | 111 | 6.9% |
| 22 | 102 | 6.4% |
| 19 | 101 | 6.3% |
| 18 | 97 | 6.0% |
| 16 | 90 | 5.6% |
| 21 | 81 | 5.0% |
| 15 | 75 | 4.7% |
| 24 | 72 | 4.5% |
| 23 | 68 | 4.2% |
| 25 | 65 | 4.0% |
| Other values (139) | 736 |
| Value | Count | Frequency (%) |
| 0.3 | 6 | |
| 0.9 | 3 | |
| 1.1 | 1 | 0.1% |
| 1.2 | 1 | 0.1% |
| 1.4 | 1 | 0.1% |
| 2 | 5 | |
| 2.4 | 1 | 0.1% |
| 2.6 | 4 | |
| 2.9 | 1 | 0.1% |
| 3 | 7 |
| Value | Count | Frequency (%) |
| 3836 | 1 | 0.1% |
| 2115 | 1 | 0.1% |
| 100 | 1 | 0.1% |
| 29 | 8 | 0.5% |
| 28.9 | 2 | 0.1% |
| 28.7 | 1 | 0.1% |
| 28 | 20 | |
| 27.5 | 4 | 0.2% |
| 27.4 | 1 | 0.1% |
| 27 | 45 |
temp_max
Real number (ℝ)
| Distinct | 164 |
|---|---|
| Distinct (%) | 10.3% |
| Missing | 8 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.23325 |
| Minimum | 1.1 |
|---|---|
| Maximum | 39 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.1 KiB |
Quantile statistics
| Minimum | 1.1 |
|---|---|
| 5-th percentile | 13.5 |
| Q1 | 24 |
| median | 29 |
| Q3 | 31.1 |
| 95-th percentile | 35 |
| Maximum | 39 |
| Range | 37.9 |
| Interquartile range (IQR) | 7.1 |
Descriptive statistics
| Standard deviation | 6.2328132 |
|---|---|
| Coefficient of variation (CV) | 0.22886777 |
| Kurtosis | 2.0490941 |
| Mean | 27.23325 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -1.2591552 |
| Sum | 43491.5 |
| Variance | 38.84796 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 30 | 140 | 8.7% |
| 28 | 126 | 7.9% |
| 32 | 125 | 7.8% |
| 29 | 108 | 6.7% |
| 31 | 106 | 6.6% |
| 33 | 72 | 4.5% |
| 24 | 65 | 4.0% |
| 27 | 64 | 4.0% |
| 22 | 63 | 3.9% |
| 26 | 63 | 3.9% |
| Other values (154) | 665 |
| Value | Count | Frequency (%) |
| 1.1 | 1 | |
| 1.8 | 1 | |
| 2.4 | 1 | |
| 2.5 | 2 | |
| 2.8 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 4.1 | 2 | |
| 4.4 | 1 | |
| 4.7 | 1 |
| Value | Count | Frequency (%) |
| 39 | 4 | 0.2% |
| 38 | 7 | 0.4% |
| 37.9 | 1 | 0.1% |
| 37 | 17 | |
| 36.8 | 2 | 0.1% |
| 36.6 | 1 | 0.1% |
| 36.4 | 1 | 0.1% |
| 36 | 36 | |
| 35.5 | 2 | 0.1% |
| 35 | 29 |
do_min
Real number (ℝ)
High correlation 
| Distinct | 101 |
|---|---|
| Distinct (%) | 6.3% |
| Missing | 8 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.9132123 |
| Minimum | 0.3 |
|---|---|
| Maximum | 28.2 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.1 KiB |
Quantile statistics
| Minimum | 0.3 |
|---|---|
| 5-th percentile | 0.98 |
| Q1 | 5.2 |
| median | 6.3 |
| Q3 | 7.1 |
| 95-th percentile | 8.2 |
| Maximum | 28.2 |
| Range | 27.9 |
| Interquartile range (IQR) | 1.9 |
Descriptive statistics
| Standard deviation | 2.1143517 |
|---|---|
| Coefficient of variation (CV) | 0.35756398 |
| Kurtosis | 14.490061 |
| Mean | 5.9132123 |
| Median Absolute Deviation (MAD) | 0.9 |
| Skewness | 0.52050182 |
| Sum | 9443.4 |
| Variance | 4.4704832 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 82 | 5.1% |
| 0.3 | 72 | 4.5% |
| 7.2 | 68 | 4.2% |
| 7.1 | 58 | 3.6% |
| 6 | 56 | 3.5% |
| 6.8 | 49 | 3.1% |
| 6.2 | 49 | 3.1% |
| 6.6 | 47 | 2.9% |
| 6.7 | 46 | 2.9% |
| 5 | 46 | 2.9% |
| Other values (91) | 1024 |
| Value | Count | Frequency (%) |
| 0.3 | 72 | |
| 0.4 | 3 | 0.2% |
| 0.5 | 1 | 0.1% |
| 0.6 | 2 | 0.1% |
| 0.7 | 1 | 0.1% |
| 0.9 | 1 | 0.1% |
| 1 | 4 | 0.2% |
| 1.1 | 4 | 0.2% |
| 1.2 | 2 | 0.1% |
| 1.4 | 3 | 0.2% |
| Value | Count | Frequency (%) |
| 28.2 | 1 | 0.1% |
| 25 | 1 | 0.1% |
| 22 | 1 | 0.1% |
| 11.4 | 1 | 0.1% |
| 11.2 | 3 | |
| 11 | 1 | 0.1% |
| 10.8 | 1 | 0.1% |
| 10.2 | 1 | 0.1% |
| 10 | 3 | |
| 9.8 | 2 |
do_max
Real number (ℝ)
High correlation 
| Distinct | 115 |
|---|---|
| Distinct (%) | 7.2% |
| Missing | 8 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.9230432 |
| Minimum | 0.3 |
|---|---|
| Maximum | 28 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.1 KiB |
Quantile statistics
| Minimum | 0.3 |
|---|---|
| 5-th percentile | 4.98 |
| Q1 | 7.1 |
| median | 7.9 |
| Q3 | 9 |
| 95-th percentile | 10.8 |
| Maximum | 28 |
| Range | 27.7 |
| Interquartile range (IQR) | 1.9 |
Descriptive statistics
| Standard deviation | 1.9151637 |
|---|---|
| Coefficient of variation (CV) | 0.24172072 |
| Kurtosis | 10.087097 |
| Mean | 7.9230432 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.088591418 |
| Sum | 12653.1 |
| Variance | 3.6678521 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7.8 | 86 | 5.4% |
| 7.6 | 67 | 4.2% |
| 9.2 | 50 | 3.1% |
| 8.2 | 50 | 3.1% |
| 7.9 | 49 | 3.1% |
| 8.4 | 47 | 2.9% |
| 8 | 46 | 2.9% |
| 8.9 | 44 | 2.7% |
| 7.5 | 44 | 2.7% |
| 6.2 | 43 | 2.7% |
| Other values (105) | 1071 |
| Value | Count | Frequency (%) |
| 0.3 | 14 | |
| 0.8 | 1 | 0.1% |
| 1 | 2 | 0.1% |
| 1.1 | 1 | 0.1% |
| 1.2 | 2 | 0.1% |
| 1.4 | 1 | 0.1% |
| 1.5 | 3 | 0.2% |
| 1.7 | 1 | 0.1% |
| 1.9 | 2 | 0.1% |
| 2.1 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 28 | 1 | 0.1% |
| 14.1 | 1 | 0.1% |
| 13.7 | 3 | |
| 13.5 | 1 | 0.1% |
| 13.2 | 1 | 0.1% |
| 13 | 1 | 0.1% |
| 12.7 | 1 | 0.1% |
| 12.6 | 1 | 0.1% |
| 12.5 | 1 | 0.1% |
| 12.4 | 2 |
ph_min
Real number (ℝ)
High correlation 
| Distinct | 77 |
|---|---|
| Distinct (%) | 4.8% |
| Missing | 8 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.81742 |
| Minimum | 1 |
|---|---|
| Maximum | 755 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 6.5 |
| Q1 | 7 |
| median | 7.2 |
| Q3 | 7.5 |
| 95-th percentile | 8 |
| Maximum | 755 |
| Range | 754 |
| Interquartile range (IQR) | 0.5 |
Descriptive statistics
| Standard deviation | 43.804484 |
|---|---|
| Coefficient of variation (CV) | 3.4175741 |
| Kurtosis | 97.896736 |
| Mean | 12.81742 |
| Median Absolute Deviation (MAD) | 0.3 |
| Skewness | 9.1230345 |
| Sum | 20469.42 |
| Variance | 1918.8328 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7.2 | 180 | |
| 7.3 | 142 | 8.8% |
| 7.1 | 141 | 8.8% |
| 7.4 | 133 | 8.3% |
| 7 | 118 | 7.4% |
| 7.5 | 112 | 7.0% |
| 6.8 | 96 | 6.0% |
| 7.6 | 85 | 5.3% |
| 6.9 | 83 | 5.2% |
| 7.7 | 75 | 4.7% |
| Other values (67) | 432 |
| Value | Count | Frequency (%) |
| 1 | 1 | 0.1% |
| 2 | 1 | 0.1% |
| 2.6 | 1 | 0.1% |
| 2.7 | 3 | |
| 2.8 | 1 | 0.1% |
| 3.1 | 1 | 0.1% |
| 3.4 | 1 | 0.1% |
| 3.6 | 1 | 0.1% |
| 3.7 | 1 | 0.1% |
| 3.8 | 3 |
| Value | Count | Frequency (%) |
| 755 | 1 | |
| 517 | 1 | |
| 410 | 1 | |
| 404 | 1 | |
| 403 | 1 | |
| 389 | 1 | |
| 364 | 1 | |
| 348 | 2 | |
| 346 | 1 | |
| 336 | 1 |
ph_max
Real number (ℝ)
High correlation 
| Distinct | 78 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 8 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20.992473 |
| Minimum | 3 |
|---|---|
| Maximum | 1878 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.1 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 7.3 |
| Q1 | 7.9 |
| median | 8.2 |
| Q3 | 8.4 |
| 95-th percentile | 8.8 |
| Maximum | 1878 |
| Range | 1875 |
| Interquartile range (IQR) | 0.5 |
Descriptive statistics
| Standard deviation | 104.19924 |
|---|---|
| Coefficient of variation (CV) | 4.9636476 |
| Kurtosis | 114.3272 |
| Mean | 20.992473 |
| Median Absolute Deviation (MAD) | 0.3 |
| Skewness | 9.8103028 |
| Sum | 33524.98 |
| Variance | 10857.482 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8.2 | 174 | |
| 8.4 | 158 | |
| 8.3 | 152 | 9.5% |
| 8.5 | 142 | 8.8% |
| 8.1 | 136 | 8.5% |
| 8 | 121 | 7.5% |
| 7.9 | 119 | 7.4% |
| 7.8 | 96 | 6.0% |
| 7.6 | 63 | 3.9% |
| 7.5 | 54 | 3.4% |
| Other values (68) | 382 |
| Value | Count | Frequency (%) |
| 3 | 3 | |
| 3.2 | 1 | 0.1% |
| 3.5 | 1 | 0.1% |
| 4.1 | 1 | 0.1% |
| 4.3 | 1 | 0.1% |
| 4.5 | 3 | |
| 4.7 | 1 | 0.1% |
| 4.9 | 1 | 0.1% |
| 5.4 | 1 | 0.1% |
| 5.5 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 1878 | 1 | |
| 1265 | 1 | |
| 1080 | 1 | |
| 989 | 1 | |
| 954 | 1 | |
| 953 | 1 | |
| 864 | 1 | |
| 861 | 1 | |
| 808 | 1 | |
| 807 | 1 |
conductivity_min
Real number (ℝ)
High correlation  Skewed 
| Distinct | 569 |
|---|---|
| Distinct (%) | 35.6% |
| Missing | 8 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 282.43945 |
| Minimum | 0 |
|---|---|
| Maximum | 34400 |
| Zeros | 7 |
| Zeros (%) | 0.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 7.44 |
| Q1 | 115 |
| median | 198 |
| Q3 | 315 |
| 95-th percentile | 761.2 |
| Maximum | 34400 |
| Range | 34400 |
| Interquartile range (IQR) | 200 |
Descriptive statistics
| Standard deviation | 898.77712 |
|---|---|
| Coefficient of variation (CV) | 3.182194 |
| Kurtosis | 1303.908 |
| Mean | 282.43945 |
| Median Absolute Deviation (MAD) | 93 |
| Skewness | 34.513287 |
| Sum | 451055.8 |
| Variance | 807800.32 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 21 | 1.3% |
| 120 | 14 | 0.9% |
| 140 | 13 | 0.8% |
| 190 | 12 | 0.7% |
| 102 | 12 | 0.7% |
| 110 | 11 | 0.7% |
| 128 | 11 | 0.7% |
| 126 | 11 | 0.7% |
| 210 | 11 | 0.7% |
| 152 | 10 | 0.6% |
| Other values (559) | 1471 |
| Value | Count | Frequency (%) |
| 0 | 7 | 0.4% |
| 1 | 21 | |
| 1.1 | 7 | 0.4% |
| 1.2 | 9 | |
| 1.3 | 2 | 0.1% |
| 1.4 | 2 | 0.1% |
| 1.5 | 3 | 0.2% |
| 1.7 | 1 | 0.1% |
| 1.8 | 3 | 0.2% |
| 1.9 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 34400 | 1 | |
| 5908 | 1 | |
| 2612 | 1 | |
| 1769 | 1 | |
| 1656 | 1 | |
| 1567 | 1 | |
| 1434 | 1 | |
| 1413 | 1 | |
| 1384 | 1 | |
| 1360 | 1 |
conductivity_max
Real number (ℝ)
High correlation 
| Distinct | 940 |
|---|---|
| Distinct (%) | 58.9% |
| Missing | 8 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1138.2594 |
| Minimum | 0 |
|---|---|
| Maximum | 54200 |
| Zeros | 7 |
| Zeros (%) | 0.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 13.8 |
| Q1 | 245 |
| median | 440 |
| Q3 | 768 |
| 95-th percentile | 2525.6 |
| Maximum | 54200 |
| Range | 54200 |
| Interquartile range (IQR) | 523 |
Descriptive statistics
| Standard deviation | 4001.248 |
|---|---|
| Coefficient of variation (CV) | 3.5152338 |
| Kurtosis | 91.968671 |
| Mean | 1138.2594 |
| Median Absolute Deviation (MAD) | 239 |
| Skewness | 9.0976796 |
| Sum | 1817800.3 |
| Variance | 16009985 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 264 | 8 | 0.5% |
| 280 | 8 | 0.5% |
| 518 | 8 | 0.5% |
| 240 | 8 | 0.5% |
| 2.6 | 7 | 0.4% |
| 0 | 7 | 0.4% |
| 190 | 7 | 0.4% |
| 176 | 7 | 0.4% |
| 178 | 7 | 0.4% |
| 716 | 6 | 0.4% |
| Other values (930) | 1524 | |
| (Missing) | 8 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 7 | |
| 1 | 5 | |
| 1.5 | 1 | 0.1% |
| 1.6 | 2 | 0.1% |
| 1.8 | 1 | 0.1% |
| 1.9 | 1 | 0.1% |
| 2 | 1 | 0.1% |
| 2.3 | 2 | 0.1% |
| 2.4 | 4 | |
| 2.5 | 5 |
| Value | Count | Frequency (%) |
| 54200 | 1 | |
| 53650 | 1 | |
| 44100 | 1 | |
| 42600 | 1 | |
| 42210 | 1 | |
| 41900 | 1 | |
| 40570 | 1 | |
| 37450 | 1 | |
| 36540 | 1 | |
| 36200 | 1 |
bod_min
Real number (ℝ)
High correlation  Skewed  Zeros 
| Distinct | 108 |
|---|---|
| Distinct (%) | 6.8% |
| Missing | 8 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 197.5817 |
| Minimum | 0 |
|---|---|
| Maximum | 170000 |
| Zeros | 56 |
| Zeros (%) | 3.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1.1 |
| Q3 | 2 |
| 95-th percentile | 6.22 |
| Maximum | 170000 |
| Range | 170000 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 5214.2161 |
|---|---|
| Coefficient of variation (CV) | 26.390178 |
| Kurtosis | 879.59653 |
| Mean | 197.5817 |
| Median Absolute Deviation (MAD) | 0.1 |
| Skewness | 29.279947 |
| Sum | 315537.97 |
| Variance | 27188050 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 620 | |
| 1.1 | 110 | 6.9% |
| 2 | 102 | 6.4% |
| 1.2 | 89 | 5.5% |
| 3 | 58 | 3.6% |
| 0 | 56 | 3.5% |
| 1.8 | 46 | 2.9% |
| 1.4 | 43 | 2.7% |
| 1.3 | 43 | 2.7% |
| 2.1 | 35 | 2.2% |
| Other values (98) | 395 |
| Value | Count | Frequency (%) |
| 0 | 56 | |
| 0.3 | 7 | 0.4% |
| 0.32 | 1 | 0.1% |
| 0.33 | 1 | 0.1% |
| 0.52 | 1 | 0.1% |
| 0.55 | 1 | 0.1% |
| 0.58 | 1 | 0.1% |
| 0.61 | 1 | 0.1% |
| 0.62 | 2 | 0.1% |
| 0.66 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 170000 | 1 | |
| 120000 | 1 | |
| 9300 | 1 | |
| 6100 | 1 | |
| 5500 | 1 | |
| 572 | 1 | |
| 306 | 1 | |
| 200 | 1 | |
| 180 | 1 | |
| 48 | 1 |
bod_max
Real number (ℝ)
High correlation  Skewed  Zeros 
| Distinct | 172 |
|---|---|
| Distinct (%) | 10.8% |
| Missing | 8 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 674.1144 |
| Minimum | 0 |
|---|---|
| Maximum | 470000 |
| Zeros | 52 |
| Zeros (%) | 3.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1.7 |
| median | 2.6 |
| Q3 | 3.8 |
| 95-th percentile | 21 |
| Maximum | 470000 |
| Range | 470000 |
| Interquartile range (IQR) | 2.1 |
Descriptive statistics
| Standard deviation | 15473.463 |
|---|---|
| Coefficient of variation (CV) | 22.953764 |
| Kurtosis | 757.96073 |
| Mean | 674.1144 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 27.006203 |
| Sum | 1076560.7 |
| Variance | 2.3942805 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 158 | 9.8% |
| 2 | 70 | 4.4% |
| 2.8 | 69 | 4.3% |
| 3 | 69 | 4.3% |
| 2.5 | 64 | 4.0% |
| 2.6 | 64 | 4.0% |
| 2.4 | 60 | 3.7% |
| 0 | 52 | 3.2% |
| 1.9 | 49 | 3.1% |
| 2.9 | 47 | 2.9% |
| Other values (162) | 895 |
| Value | Count | Frequency (%) |
| 0 | 52 | 3.2% |
| 0.3 | 5 | 0.3% |
| 0.32 | 1 | 0.1% |
| 0.62 | 1 | 0.1% |
| 0.75 | 1 | 0.1% |
| 0.78 | 1 | 0.1% |
| 0.88 | 1 | 0.1% |
| 0.89 | 1 | 0.1% |
| 1 | 158 | |
| 1.1 | 19 | 1.2% |
| Value | Count | Frequency (%) |
| 470000 | 1 | |
| 380000 | 1 | |
| 100000 | 1 | |
| 81000 | 1 | |
| 31000 | 1 | |
| 5500 | 1 | |
| 560 | 1 | |
| 294 | 1 | |
| 145 | 1 | |
| 131 | 1 |
nitrate_min
Real number (ℝ)
High correlation  Missing  Skewed  Zeros 
| Distinct | 216 |
|---|---|
| Distinct (%) | 13.9% |
| Missing | 56 |
| Missing (%) | 3.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 373.63478 |
| Minimum | 0 |
|---|---|
| Maximum | 230000 |
| Zeros | 170 |
| Zeros (%) | 10.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.3 |
| median | 0.32 |
| Q3 | 0.62 |
| 95-th percentile | 2.632 |
| Maximum | 230000 |
| Range | 230000 |
| Interquartile range (IQR) | 0.32 |
Descriptive statistics
| Standard deviation | 6779.7129 |
|---|---|
| Coefficient of variation (CV) | 18.145294 |
| Kurtosis | 873.63978 |
| Mean | 373.63478 |
| Median Absolute Deviation (MAD) | 0.11 |
| Skewness | 27.507003 |
| Sum | 578760.28 |
| Variance | 45964507 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.32 | 389 | |
| 0.3 | 279 | |
| 0 | 170 | 10.6% |
| 0.02 | 42 | 2.6% |
| 0.6 | 39 | 2.4% |
| 0.7 | 32 | 2.0% |
| 2 | 26 | 1.6% |
| 0.4 | 22 | 1.4% |
| 0.5 | 17 | 1.1% |
| 0.33 | 16 | 1.0% |
| Other values (206) | 517 | |
| (Missing) | 56 | 3.5% |
| Value | Count | Frequency (%) |
| 0 | 170 | |
| 0.02 | 42 | 2.6% |
| 0.03 | 1 | 0.1% |
| 0.04 | 1 | 0.1% |
| 0.06 | 1 | 0.1% |
| 0.1 | 1 | 0.1% |
| 0.11 | 1 | 0.1% |
| 0.12 | 3 | 0.2% |
| 0.13 | 1 | 0.1% |
| 0.14 | 3 | 0.2% |
| Value | Count | Frequency (%) |
| 230000 | 1 | 0.1% |
| 70000 | 3 | |
| 45000 | 1 | 0.1% |
| 33000 | 1 | 0.1% |
| 17000 | 1 | 0.1% |
| 13000 | 1 | 0.1% |
| 11000 | 1 | 0.1% |
| 7800 | 1 | 0.1% |
| 2400 | 1 | 0.1% |
| 2300 | 1 | 0.1% |
nitrate_max
Real number (ℝ)
High correlation  Missing  Skewed  Zeros 
| Distinct | 547 |
|---|---|
| Distinct (%) | 35.3% |
| Missing | 56 |
| Missing (%) | 3.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13430.24 |
| Minimum | 0 |
|---|---|
| Maximum | 14000000 |
| Zeros | 122 |
| Zeros (%) | 7.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.67 |
| median | 1.43 |
| Q3 | 3.38 |
| 95-th percentile | 27.248 |
| Maximum | 14000000 |
| Range | 14000000 |
| Interquartile range (IQR) | 2.71 |
Descriptive statistics
| Standard deviation | 371927.94 |
|---|---|
| Coefficient of variation (CV) | 27.693321 |
| Kurtosis | 1299.9152 |
| Mean | 13430.24 |
| Median Absolute Deviation (MAD) | 1.1 |
| Skewness | 35.105075 |
| Sum | 20803441 |
| Variance | 1.383304 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 122 | 7.6% |
| 0.3 | 93 | 5.8% |
| 0.32 | 31 | 1.9% |
| 1.4 | 21 | 1.3% |
| 1.2 | 19 | 1.2% |
| 1.3 | 14 | 0.9% |
| 1.6 | 12 | 0.7% |
| 1 | 12 | 0.7% |
| 1.1 | 12 | 0.7% |
| 1.42 | 11 | 0.7% |
| Other values (537) | 1202 | |
| (Missing) | 56 | 3.5% |
| Value | Count | Frequency (%) |
| 0 | 122 | |
| 0.26 | 1 | 0.1% |
| 0.3 | 93 | |
| 0.31 | 5 | 0.3% |
| 0.32 | 31 | 1.9% |
| 0.33 | 9 | 0.6% |
| 0.34 | 6 | 0.4% |
| 0.35 | 2 | 0.1% |
| 0.36 | 4 | 0.2% |
| 0.37 | 5 | 0.3% |
| Value | Count | Frequency (%) |
| 14000000 | 1 | 0.1% |
| 3400000 | 1 | 0.1% |
| 2600000 | 1 | 0.1% |
| 170000 | 2 | |
| 130000 | 1 | 0.1% |
| 94000 | 1 | 0.1% |
| 78000 | 1 | 0.1% |
| 49000 | 2 | |
| 17000 | 1 | 0.1% |
| 4900 | 4 |
fecal_coliform_min
Real number (ℝ)
High correlation  Missing  Skewed 
| Distinct | 186 |
|---|---|
| Distinct (%) | 13.1% |
| Missing | 184 |
| Missing (%) | 11.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4027.5154 |
| Minimum | 0 |
|---|---|
| Maximum | 2200000 |
| Zeros | 1 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 27 |
| Q3 | 350 |
| 95-th percentile | 6300 |
| Maximum | 2200000 |
| Range | 2200000 |
| Interquartile range (IQR) | 347 |
Descriptive statistics
| Standard deviation | 60217.317 |
|---|---|
| Coefficient of variation (CV) | 14.95148 |
| Kurtosis | 1248.3795 |
| Mean | 4027.5154 |
| Median Absolute Deviation (MAD) | 25 |
| Skewness | 34.372955 |
| Sum | 5723099.4 |
| Variance | 3.6261252 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 353 | |
| 4 | 56 | 3.5% |
| 130 | 34 | 2.1% |
| 7 | 33 | 2.1% |
| 1300 | 31 | 1.9% |
| 110 | 29 | 1.8% |
| 360 | 29 | 1.8% |
| 6 | 28 | 1.7% |
| 11 | 26 | 1.6% |
| 78 | 23 | 1.4% |
| Other values (176) | 779 | |
| (Missing) | 184 | 11.5% |
| Value | Count | Frequency (%) |
| 0 | 1 | 0.1% |
| 1.42 | 1 | 0.1% |
| 2 | 353 | |
| 3 | 10 | 0.6% |
| 4 | 56 | 3.5% |
| 5 | 8 | 0.5% |
| 6 | 28 | 1.7% |
| 7 | 33 | 2.1% |
| 8 | 19 | 1.2% |
| 9 | 19 | 1.2% |
| Value | Count | Frequency (%) |
| 2200000 | 1 | 0.1% |
| 310000 | 1 | 0.1% |
| 170000 | 2 | |
| 130000 | 3 | |
| 110000 | 4 | |
| 100000 | 2 | |
| 94000 | 1 | 0.1% |
| 82000 | 1 | 0.1% |
| 79000 | 1 | 0.1% |
| 68000 | 1 | 0.1% |
fecal_coliform_max
Real number (ℝ)
High correlation  Missing  Skewed 
| Distinct | 251 |
|---|---|
| Distinct (%) | 17.7% |
| Missing | 185 |
| Missing (%) | 11.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 79086.489 |
| Minimum | 2 |
|---|---|
| Maximum | 24000000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.1 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 30 |
| median | 210 |
| Q3 | 2300 |
| 95-th percentile | 92000 |
| Maximum | 24000000 |
| Range | 23999998 |
| Interquartile range (IQR) | 2270 |
Descriptive statistics
| Standard deviation | 919614.45 |
|---|---|
| Coefficient of variation (CV) | 11.627959 |
| Kurtosis | 516.19575 |
| Mean | 79086.489 |
| Median Absolute Deviation (MAD) | 208 |
| Skewness | 21.593384 |
| Sum | 1.1230281 × 108 |
| Variance | 8.4569073 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 114 | 7.1% |
| 92000 | 57 | 3.6% |
| 6 | 35 | 2.2% |
| 170 | 33 | 2.1% |
| 1100 | 31 | 1.9% |
| 4 | 27 | 1.7% |
| 110 | 25 | 1.6% |
| 2300 | 24 | 1.5% |
| 920 | 24 | 1.5% |
| 1600 | 23 | 1.4% |
| Other values (241) | 1027 | |
| (Missing) | 185 | 11.5% |
| Value | Count | Frequency (%) |
| 2 | 114 | |
| 3 | 2 | 0.1% |
| 4 | 27 | 1.7% |
| 5 | 5 | 0.3% |
| 6 | 35 | 2.2% |
| 7 | 10 | 0.6% |
| 8 | 15 | 0.9% |
| 9 | 13 | 0.8% |
| 10 | 12 | 0.7% |
| 11 | 8 | 0.5% |
| Value | Count | Frequency (%) |
| 24000000 | 1 | 0.1% |
| 21000000 | 1 | 0.1% |
| 7000000 | 2 | |
| 4800000 | 1 | 0.1% |
| 4100000 | 1 | 0.1% |
| 4000000 | 1 | 0.1% |
| 2700000 | 1 | 0.1% |
| 2200000 | 2 | |
| 1700000 | 3 | |
| 1400000 | 1 | 0.1% |
total_coliform_min
Real number (ℝ)
High correlation  Missing  Skewed 
| Distinct | 206 |
|---|---|
| Distinct (%) | 14.9% |
| Missing | 224 |
| Missing (%) | 14.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6314.8479 |
| Minimum | 2 |
|---|---|
| Maximum | 3200000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.1 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 33 |
| median | 170 |
| Q3 | 1100 |
| 95-th percentile | 11000 |
| Maximum | 3200000 |
| Range | 3199998 |
| Interquartile range (IQR) | 1067 |
Descriptive statistics
| Standard deviation | 89247.465 |
|---|---|
| Coefficient of variation (CV) | 14.132956 |
| Kurtosis | 1191.541 |
| Mean | 6314.8479 |
| Median Absolute Deviation (MAD) | 160 |
| Skewness | 33.494307 |
| Sum | 8720805 |
| Variance | 7.9651101 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 76 | 4.7% |
| 17 | 51 | 3.2% |
| 110 | 43 | 2.7% |
| 1100 | 36 | 2.2% |
| 170 | 31 | 1.9% |
| 350 | 31 | 1.9% |
| 330 | 29 | 1.8% |
| 21 | 27 | 1.7% |
| 490 | 26 | 1.6% |
| 3300 | 25 | 1.6% |
| Other values (196) | 1006 | |
| (Missing) | 224 | 14.0% |
| Value | Count | Frequency (%) |
| 2 | 76 | |
| 3 | 1 | 0.1% |
| 4 | 8 | 0.5% |
| 5 | 4 | 0.2% |
| 6 | 1 | 0.1% |
| 7 | 4 | 0.2% |
| 8 | 4 | 0.2% |
| 9 | 5 | 0.3% |
| 10 | 7 | 0.4% |
| 11 | 12 | 0.7% |
| Value | Count | Frequency (%) |
| 3200000 | 1 | 0.1% |
| 540000 | 1 | 0.1% |
| 270000 | 1 | 0.1% |
| 220000 | 3 | |
| 210000 | 2 | |
| 170000 | 1 | 0.1% |
| 150000 | 1 | 0.1% |
| 140000 | 2 | |
| 130000 | 1 | 0.1% |
| 120000 | 1 | 0.1% |
total_coliform_max
Real number (ℝ)
High correlation  Missing  Skewed 
| Distinct | 254 |
|---|---|
| Distinct (%) | 18.4% |
| Missing | 224 |
| Missing (%) | 14.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 360842.84 |
| Minimum | 2 |
|---|---|
| Maximum | 1.6 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.1 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 41 |
| Q1 | 170 |
| median | 1300 |
| Q3 | 4900 |
| 95-th percentile | 160000 |
| Maximum | 1.6 × 108 |
| Range | 1.6 × 108 |
| Interquartile range (IQR) | 4730 |
Descriptive statistics
| Standard deviation | 5346977.8 |
|---|---|
| Coefficient of variation (CV) | 14.818024 |
| Kurtosis | 647.27872 |
| Mean | 360842.84 |
| Median Absolute Deviation (MAD) | 1230 |
| Skewness | 24.0201 |
| Sum | 4.9832396 × 108 |
| Variance | 2.8590171 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1600 | 79 | 4.9% |
| 160000 | 79 | 4.9% |
| 350 | 49 | 3.1% |
| 4900 | 42 | 2.6% |
| 49 | 38 | 2.4% |
| 280 | 35 | 2.2% |
| 92000 | 29 | 1.8% |
| 170 | 28 | 1.7% |
| 920 | 26 | 1.6% |
| 2000 | 25 | 1.6% |
| Other values (244) | 951 | |
| (Missing) | 224 | 14.0% |
| Value | Count | Frequency (%) |
| 2 | 18 | |
| 4 | 5 | 0.3% |
| 6 | 1 | 0.1% |
| 16 | 1 | 0.1% |
| 17 | 1 | 0.1% |
| 20 | 2 | 0.1% |
| 21 | 1 | 0.1% |
| 22 | 1 | 0.1% |
| 23 | 1 | 0.1% |
| 24 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 160000000 | 1 | |
| 92000000 | 1 | |
| 54000000 | 1 | |
| 35000000 | 1 | |
| 28000000 | 1 | |
| 14000000 | 1 | |
| 9400000 | 1 | |
| 9200000 | 1 | |
| 7900000 | 1 | |
| 6300000 | 1 |
fecal_streptococci_min
Real number (ℝ)
High correlation  Missing 
| Distinct | 75 |
|---|---|
| Distinct (%) | 8.9% |
| Missing | 766 |
| Missing (%) | 47.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 115.96377 |
| Minimum | 1.8 |
|---|---|
| Maximum | 17000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.1 KiB |
Quantile statistics
| Minimum | 1.8 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 2 |
| Q3 | 17 |
| 95-th percentile | 240 |
| Maximum | 17000 |
| Range | 16998.2 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 772.46797 |
|---|---|
| Coefficient of variation (CV) | 6.6612873 |
| Kurtosis | 293.4359 |
| Mean | 115.96377 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 15.269753 |
| Sum | 97293.6 |
| Variance | 596706.76 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 518 | |
| 4 | 29 | 1.8% |
| 17 | 24 | 1.5% |
| 120 | 23 | 1.4% |
| 210 | 18 | 1.1% |
| 14 | 13 | 0.8% |
| 240 | 12 | 0.7% |
| 6 | 12 | 0.7% |
| 110 | 11 | 0.7% |
| 5 | 11 | 0.7% |
| Other values (65) | 168 | 10.5% |
| (Missing) | 766 |
| Value | Count | Frequency (%) |
| 1.8 | 1 | 0.1% |
| 2 | 518 | |
| 2.8 | 1 | 0.1% |
| 3 | 2 | 0.1% |
| 4 | 29 | 1.8% |
| 5 | 11 | 0.7% |
| 6 | 12 | 0.7% |
| 7 | 9 | 0.6% |
| 8 | 4 | 0.2% |
| 9 | 5 | 0.3% |
| Value | Count | Frequency (%) |
| 17000 | 1 | |
| 7900 | 1 | |
| 6300 | 1 | |
| 4900 | 1 | |
| 4600 | 1 | |
| 4100 | 1 | |
| 3400 | 1 | |
| 2700 | 1 | |
| 2400 | 1 | |
| 2000 | 1 |
fecal_streptococci_max
Real number (ℝ)
High correlation  Missing  Skewed 
| Distinct | 108 |
|---|---|
| Distinct (%) | 12.9% |
| Missing | 767 |
| Missing (%) | 47.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1375.8229 |
| Minimum | 2 |
|---|---|
| Maximum | 540000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.1 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 6 |
| Q3 | 220 |
| 95-th percentile | 1430 |
| Maximum | 540000 |
| Range | 539998 |
| Interquartile range (IQR) | 218 |
Descriptive statistics
| Standard deviation | 19792.548 |
|---|---|
| Coefficient of variation (CV) | 14.385971 |
| Kurtosis | 661.27308 |
| Mean | 1375.8229 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 24.806712 |
| Sum | 1152939.6 |
| Variance | 3.9174496 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 364 | |
| 290 | 32 | 2.0% |
| 540 | 22 | 1.4% |
| 4 | 21 | 1.3% |
| 490 | 19 | 1.2% |
| 240 | 18 | 1.1% |
| 6 | 15 | 0.9% |
| 17 | 14 | 0.9% |
| 3 | 14 | 0.9% |
| 20 | 12 | 0.7% |
| Other values (98) | 307 | |
| (Missing) | 767 |
| Value | Count | Frequency (%) |
| 2 | 364 | |
| 2.6 | 1 | 0.1% |
| 3 | 14 | 0.9% |
| 4 | 21 | 1.3% |
| 5 | 8 | 0.5% |
| 6 | 15 | 0.9% |
| 7 | 5 | 0.3% |
| 8 | 3 | 0.2% |
| 9 | 4 | 0.2% |
| 11 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 540000 | 1 | |
| 130000 | 2 | |
| 35000 | 2 | |
| 17000 | 1 | |
| 16000 | 1 | |
| 13000 | 1 | |
| 11000 | 1 | |
| 9800 | 1 | |
| 9400 | 2 | |
| 9200 | 1 |
Interactions
Correlations
| bod_max | bod_min | conductivity_max | conductivity_min | do_max | do_min | fecal_coliform_max | fecal_coliform_min | fecal_streptococci_max | fecal_streptococci_min | nitrate_max | nitrate_min | ph_max | ph_min | station_code | temp_max | temp_min | total_coliform_max | total_coliform_min | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| bod_max | 1.000 | 0.698 | 0.537 | 0.544 | -0.390 | -0.558 | 0.278 | 0.276 | 0.344 | 0.332 | 0.154 | -0.013 | 0.047 | 0.021 | -0.060 | 0.245 | 0.260 | 0.284 | 0.382 |
| bod_min | 0.698 | 1.000 | 0.365 | 0.422 | -0.404 | -0.418 | 0.150 | 0.228 | 0.280 | 0.373 | 0.129 | 0.043 | -0.153 | -0.063 | -0.108 | 0.141 | 0.287 | 0.163 | 0.345 |
| conductivity_max | 0.537 | 0.365 | 1.000 | 0.797 | -0.336 | -0.426 | -0.078 | -0.115 | -0.065 | -0.056 | 0.229 | 0.002 | 0.236 | 0.144 | -0.064 | 0.263 | 0.382 | -0.033 | -0.012 |
| conductivity_min | 0.544 | 0.422 | 0.797 | 1.000 | -0.355 | -0.402 | -0.060 | -0.029 | 0.018 | 0.051 | 0.168 | 0.090 | 0.192 | 0.246 | -0.050 | 0.252 | 0.377 | -0.050 | 0.054 |
| do_max | -0.390 | -0.404 | -0.336 | -0.355 | 1.000 | 0.562 | 0.175 | 0.058 | -0.044 | -0.159 | -0.264 | -0.122 | 0.159 | 0.042 | 0.066 | -0.100 | -0.343 | 0.193 | 0.034 |
| do_min | -0.558 | -0.418 | -0.426 | -0.402 | 0.562 | 1.000 | -0.216 | -0.217 | -0.326 | -0.364 | -0.178 | -0.008 | 0.089 | 0.180 | 0.077 | -0.310 | -0.337 | -0.209 | -0.292 |
| fecal_coliform_max | 0.278 | 0.150 | -0.078 | -0.060 | 0.175 | -0.216 | 1.000 | 0.828 | 0.750 | 0.524 | -0.160 | -0.106 | -0.018 | -0.070 | 0.108 | 0.324 | -0.108 | 0.921 | 0.793 |
| fecal_coliform_min | 0.276 | 0.228 | -0.115 | -0.029 | 0.058 | -0.217 | 0.828 | 1.000 | 0.752 | 0.658 | -0.185 | -0.047 | -0.028 | -0.040 | 0.031 | 0.277 | -0.106 | 0.704 | 0.873 |
| fecal_streptococci_max | 0.344 | 0.280 | -0.065 | 0.018 | -0.044 | -0.326 | 0.750 | 0.752 | 1.000 | 0.757 | -0.090 | 0.036 | -0.134 | -0.232 | 0.002 | 0.434 | 0.208 | 0.656 | 0.754 |
| fecal_streptococci_min | 0.332 | 0.373 | -0.056 | 0.051 | -0.159 | -0.364 | 0.524 | 0.658 | 0.757 | 1.000 | 0.093 | 0.232 | -0.174 | -0.187 | -0.029 | 0.286 | 0.222 | 0.431 | 0.650 |
| nitrate_max | 0.154 | 0.129 | 0.229 | 0.168 | -0.264 | -0.178 | -0.160 | -0.185 | -0.090 | 0.093 | 1.000 | 0.580 | 0.122 | -0.009 | -0.027 | -0.124 | 0.131 | -0.168 | -0.200 |
| nitrate_min | -0.013 | 0.043 | 0.002 | 0.090 | -0.122 | -0.008 | -0.106 | -0.047 | 0.036 | 0.232 | 0.580 | 1.000 | 0.019 | 0.036 | 0.032 | -0.136 | 0.057 | -0.178 | -0.116 |
| ph_max | 0.047 | -0.153 | 0.236 | 0.192 | 0.159 | 0.089 | -0.018 | -0.028 | -0.134 | -0.174 | 0.122 | 0.019 | 1.000 | 0.520 | -0.057 | 0.003 | 0.031 | -0.025 | -0.086 |
| ph_min | 0.021 | -0.063 | 0.144 | 0.246 | 0.042 | 0.180 | -0.070 | -0.040 | -0.232 | -0.187 | -0.009 | 0.036 | 0.520 | 1.000 | 0.043 | -0.102 | 0.002 | -0.104 | -0.105 |
| station_code | -0.060 | -0.108 | -0.064 | -0.050 | 0.066 | 0.077 | 0.108 | 0.031 | 0.002 | -0.029 | -0.027 | 0.032 | -0.057 | 0.043 | 1.000 | -0.106 | -0.086 | 0.105 | 0.018 |
| temp_max | 0.245 | 0.141 | 0.263 | 0.252 | -0.100 | -0.310 | 0.324 | 0.277 | 0.434 | 0.286 | -0.124 | -0.136 | 0.003 | -0.102 | -0.106 | 1.000 | 0.427 | 0.321 | 0.325 |
| temp_min | 0.260 | 0.287 | 0.382 | 0.377 | -0.343 | -0.337 | -0.108 | -0.106 | 0.208 | 0.222 | 0.131 | 0.057 | 0.031 | 0.002 | -0.086 | 0.427 | 1.000 | -0.064 | 0.020 |
| total_coliform_max | 0.284 | 0.163 | -0.033 | -0.050 | 0.193 | -0.209 | 0.921 | 0.704 | 0.656 | 0.431 | -0.168 | -0.178 | -0.025 | -0.104 | 0.105 | 0.321 | -0.064 | 1.000 | 0.775 |
| total_coliform_min | 0.382 | 0.345 | -0.012 | 0.054 | 0.034 | -0.292 | 0.793 | 0.873 | 0.754 | 0.650 | -0.200 | -0.116 | -0.086 | -0.105 | 0.018 | 0.325 | 0.020 | 0.775 | 1.000 |
Missing values
Sample
| station_code | monitoring_location | state_name | temp_min | temp_max | do_min | do_max | ph_min | ph_max | conductivity_min | conductivity_max | bod_min | bod_max | nitrate_min | nitrate_max | fecal_coliform_min | fecal_coliform_max | total_coliform_min | total_coliform_max | fecal_streptococci_min | fecal_streptococci_max | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1986 | mg/L mg/L Ml 100 mL | NaN | 100.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1 | 1001 | RIVER BEAS AT U/S MANALI | HIMACHAL PRADESH | 2.0 | 24.0 | 7.8 | 9.2 | 7.2 | 8.2 | 68.0 | 380.0 | 1.0 | 2.8 | 0.32 | 1.15 | 2.0 | 170.0 | 63.0 | 540.0 | 2.0 | 2.0 |
| 2 | 2601 | RIVER BEAS AT D/S MANALI | HIMACHAL PRADESH | 2.0 | 13.0 | 7.6 | 9.0 | 6.5 | 8.1 | 58.0 | 135.0 | 1.0 | 2.8 | 0.32 | 1.87 | 110.0 | 1600.0 | 920.0 | 1600.0 | 2.0 | 2.0 |
| 3 | 4444 | RIVER BEAS D/S OF WASTE PROCESSING FACILITY AT MANALI | HIMACHAL PRADESH | 2.0 | 13.0 | 7.8 | 8.8 | 6.7 | 7.8 | 62.0 | 113.0 | 1.0 | 2.8 | 0.32 | 1.08 | 110.0 | 1600.0 | 350.0 | 1600.0 | 2.0 | 2.0 |
| 4 | 4037 | RIVER BEAS D/S MANALSU NALLAH | HIMACHAL PRADESH | 2.0 | 14.0 | 7.9 | 8.9 | 6.3 | 8.0 | 52.0 | 137.0 | 1.0 | 1.0 | 0.32 | 1.74 | 22.0 | 110.0 | 79.0 | 540.0 | 2.0 | 2.0 |
| 5 | 3866 | RIVER BEAS U/S BEFORE CONF. OF MANALSU NALLAH | HIMACHAL PRADESH | 2.0 | 13.0 | 7.8 | 9.1 | 7.0 | 7.8 | 51.0 | 113.0 | 1.0 | 1.0 | 0.32 | 0.97 | 23.0 | 120.0 | 110.0 | 430.0 | 2.0 | 2.0 |
| 6 | 2602 | RIVER BEAS, U/S KULLU | HIMACHAL PRADESH | 4.0 | 16.0 | 7.6 | 8.7 | 6.7 | 7.8 | 44.0 | 128.0 | 1.0 | 1.0 | 0.32 | 0.90 | 49.0 | 220.0 | 240.0 | 920.0 | 2.0 | 2.0 |
| 7 | 4445 | RIVER BEAS D/S OF WASTE PROCESSING FACILITY AT KULLU | HIMACHAL PRADESH | 4.0 | 16.0 | 7.6 | 8.4 | 6.6 | 7.7 | 73.0 | 170.0 | 1.0 | 1.6 | 0.32 | 0.96 | 47.0 | 150.0 | 280.0 | 920.0 | 2.0 | 2.0 |
| 8 | 1002 | RIVER BEAS D/S KULLU | HIMACHAL PRADESH | 4.0 | 16.0 | 7.5 | 8.4 | 7.2 | 8.0 | 77.0 | 144.0 | 1.0 | 1.6 | 0.32 | 2.65 | 110.0 | 540.0 | 540.0 | 1600.0 | 2.0 | 2.0 |
| 9 | 1003 | RIVER BEAS D/S AUT | HIMACHAL PRADESH | 5.0 | 16.0 | 7.5 | 8.1 | 7.2 | 8.2 | 61.0 | 129.0 | 1.0 | 1.0 | 0.32 | 0.77 | 34.0 | 280.0 | 170.0 | 1600.0 | 2.0 | 2.0 |
| station_code | monitoring_location | state_name | temp_min | temp_max | do_min | do_max | ph_min | ph_max | conductivity_min | conductivity_max | bod_min | bod_max | nitrate_min | nitrate_max | fecal_coliform_min | fecal_coliform_max | total_coliform_min | total_coliform_max | fecal_streptococci_min | fecal_streptococci_max | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1601 | 30068 | RIVER SUBARNAREKHA AT GOPIBALLAVPUR (WEST BENGAL) | WEST BENGAL | 25.0 | 35.0 | 6.8 | 12.5 | 7.0 | 8.2 | 153.0 | 393.0 | 1.0 | 1.0 | 0.32 | 0.37 | 110.00 | 2200000.0 | 1400.0 | 35000000.0 | 2.0 | 78.0 |
| 1602 | 30069 | RIVER SUBARNAREKHA AT LAKHANNATH (ORISSA) | ODISHA | 24.0 | 31.0 | 7.2 | 9.2 | 7.0 | 8.5 | 152.0 | 334.0 | 1.0 | 1.0 | 0.32 | 0.32 | 170.00 | 1100000.0 | 1700.0 | 28000000.0 | 2.0 | 78.0 |
| 1603 | 4086 | RIVER SUBARNAREKHA U/S HINDALCO IND LTD, MURLI WORKS, CHHOTAMURI, RANCHI | JHARKHAND | 15.0 | 27.0 | 7.4 | 7.9 | 6.7 | 7.4 | 2.0 | 2.8 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1604 | 4756 | RIVER SUBARNAREKHA AT RUCCA DAM, RUCCA, RANCHI | JHARKHAND | 8.6 | 23.0 | 7.6 | 8.0 | 7.2 | 7.3 | 1.7 | 3.1 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1605 | 2423 | RIVER BUDHABALANGA AT D/S OF BARIPADA TOWN | ODISHA | 20.0 | 30.0 | 5.2 | 9.2 | 7.2 | 8.4 | 146.0 | 326.0 | 1.3 | 2.2 | 0.00 | 0.87 | 780.00 | 4900.0 | 2800.0 | 17000.0 | 5.0 | 49.0 |
| 1606 | 3943 | RIVER BUDHABALANGA AT BALASORE U/S | ODISHA | 21.0 | 30.0 | 5.6 | 9.6 | 6.9 | 8.5 | 125.0 | 383.0 | 1.1 | 2.4 | 0.32 | 0.86 | 230.00 | 2300.0 | 1300.0 | 4900.0 | NaN | NaN |
| 1607 | 3942 | RIVER SONO AT KANAKDURGA ROAD NEAR REMUNA, HATOGOND | ODISHA | 20.0 | 30.0 | 3.2 | 9.6 | 6.9 | 8.5 | 120.0 | 433.0 | 1.1 | 1.8 | 0.32 | 0.93 | 78.00 | 13000.0 | 490.0 | 35000.0 | NaN | NaN |
| 1608 | 2115 | 20.0 22.0 6.6 7.7 7.7 8.6 202 572 1.0 1.8 0.57 1.42 2 2 37 63 | NaN | 2115.0 | 20.0 | 22.0 | 6.6 | 7.7 | 7.7 | 8.6 | 202.0 | 572.0 | 1.0 | 1.80 | 0.57 | 1.42 | 2.0 | 2.0 | 37.0 | 63.0 | NaN |
| 1609 | 4753 | RIVER HARMU NEAR HARMU BRIDGE, HARMU, RANCHI | JHARKHAND | 11.0 | 24.0 | 2.9 | 3.6 | 6.5 | 6.6 | 2.5 | 11.7 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1610 | 4754 | RIVER HARMU BEFORE METTING TO SWARNREKHA RIVER | JHARKHAND | 11.4 | 25.0 | 3.1 | 4.8 | 6.5 | 6.7 | 4.5 | 11.1 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |